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Information fusion from multiple sources is a critical requirement for System Wide 
Information Management in the National Airspace (NAS). NASA and the FAA envison 
creating an “integrated pool” of information originally coming from different sources, which 
users, intelligent agents and NAS decision support tools can tap into. In this paper we 
present the results of our initial investigations into the requirements and prototype 
development of such an integrated information pool for the NAS. We have attempted to 
ascertain key requirements for such an integrated pool based on a survey of DSS tools that 
will benefit from this integrated pool. We then advocate key technologies from computer 
science research areas such as the semantic web, information integration, and intelligent 
agents that we believe are well suited to achieving the envisioned system wide information 
management capabilities. 


L Introduction 

C onsider a situation where bad weather around a particular airport requires that the controllers suspend landing 
and take-off of any aircraft from that airport for a 2 hour period. All flights scheduled to land into or take-off 
from this airport in that specified period thus need to be rerouted or rescheduled. Naturally this disruption causes 
flight delays, causes passengers to miss connections and may result in additional expenses for the airlines and the 
FAA..Can we deal efficiently and optimally with such situations i.e., do the required scheduling and rerouting by 
minimizing the flight delays and missed connections and also minimizing the additional financial costs to the 
airlines and authorities ? This is precisely the goal of the NASA FAA System Wide Information Management 
(SWIM) initiative. The SWIM initiative proposes to have various distributed information sources and services in 
National Airspace (NAS) connected seamlessly so that end users such as air-traffic controllers, other decision 
makers and decision aids such as decision support tools (DSS) or intelligent agents have integrated access to all the 
information needed to make informed decisions in the above kinds of situations. In this paper we present the results 
of our initial investigation into a critical aspect of SWIM which is the integration issue. Seamless integrated access 
needs to be provided to the various distributed, heterogeneous and autonomously created and maintained 
information sources and services in the NAS. One of our thrusts has been to ascertain the information integration 
requirements in the NAS domain by talking to various potential stake-holders and familiarizing ourselves with the 
information access needs in the NAS. The other thrust has been to then evaluate what concepts, technologies, 
approaches and systems we can take from the state-of-the-art in computer science and information management 
research to apply to the information integration problem in the NAS. We present our findings on the above in the 
following sections concluding with recommendations for further activity. 


II. Survey of DSS Tools and Information Access Requirements 

It is clearly stressed that it is the NAS DSS tools that would benefit greatly by having access to an integrated pool of 
information rather than accessing the information through multiple stovepipe connections to the various information 
sources and services they need to access. We thus set for ourselves the following goals to better understand the 
domain. 

• Obtain an understanding of NAS DSS tools, especially from the perspective of their requiring integrated access 
to multiple data sources. 
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• Determine the various types of data sources that are used by DSS tools and that need to be integrated into the 
envisioned SWIM information pool. 

We started by interviewing a number of personnel in the NASA Ames AATT Office who are intimately familiar 
with such NAS DSS tools and have actually developed some of these tools in the past. We conducted personal 
interviews and also surveyed a number of Operational Concept Documents (OCDs) for these DSS tools as advised 
by them. Some of the key information that we have ascertained is as follows: 

• Information in the NAS comes from a wide variety of information sources and is of different kinds. 

The information could be time referenced information, geographically referenced information (such as airport 
locations, terrain information etc), weather information, air-traffic hazard information (bird activity), navigation 
system information, flight information, safety information (accident statistics etc) and other business 
management information. 

• There are different kinds of systems providing and accessing information 

Namely flight service station systems, tower Systems, terminal Systems, enroute systems, oceanic Systems, 
TFM (Traffic Flow Management) systems, and security systems. 

• DSS tools fall into various categories based on the high level problems they were designed to address 

The high level goals for the future NAS include providing emerging situational awareness, collaborative decision 
making, and moving towards “free-flight” concepts. The key problems in the NAS that they are targeted top 
address are: 

1. Airspace organization and management. 

2. Demand capacity and balancing. 

3. Traffic synchronization, and 

4. Separation management . 

DSS tools fall into the following 7 broad categories: 

(a) Surface DSS 

(b) Terminal DSS 

(c) Enroute DSS 

(d) Oceanic DSS 

(e) TFM DSS 

(f) Infrastructure Management DSS 

(g) Flight Planning DSS 

With guidance from NASA Ames AATT Office personnel, the AIMS team surveyed a number of Operational 
Concept Documents (OCDs) for NAS DSS Tools. Some of the relevant tool descriptions we surveyed were: 
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DSS Tool 


Description and capabilities 


Surface Management NASA FAA collaboration to develop a DSS 
System (SMS) tool - SMS (NASA 2003) that helps controllers 

and air carriers collaboratively manage the 
movements of aircraft on the surface of busy 
airports, thereby improving capacity, efficiency, 
and flexibility. 


Information Sources/Information 
Accessed 

(a) ETMS t 

- Planned pushback times, flight 
plans, EDCTs (Expected Departure Clearance 
Time) 

(b) Surface Surveillance 

(c) Airport configuration, Miles-in- 
trail (National Log Program) 

(d) Ramp Tower 


Traffic 

Management (TFM) 


- Parking gates, pushback events, 
ready to push status, flight priorities 


Flow 


Air traffic management is the strategic 
management of traffic flow to minimize delays 
and congestion while maximizing the overall 
throughput of the NAS. The goal of TFM is to 
provide strategic planning and management of 
air traffic demand to ensure smooth and 
efficient traffic flow through FAA-controlled 
airspace. 


(a) ETMS 

- GS/GDP parameters, schedules and 
flight plans, SUAs, radar tracks, etc. 

(b) ATCSCC (Air Traffic Control 
System Command Center) 

- Routes, advisories, equipment status 

- Demand rate, alert thresholds, reroutes, 
capacity 

(c) ARTCCs TMU (Air Route 
Traffic Control Center, Traffic 
Management Unit) 

- NAS messages, flow rates, thresholds 
etc. 


(d) AOCs (Airline Operations 
Control Center) 


Surface Movement The SMA, a joint development between the 
Advisor (SMA) FAA and NASA, is designed to assist in airport 

surface operations. SMA electronically connects 
ATC, airline, and airport operations information 
to facilitate information sharing and improve 
taxi queuing. Further integration is also required 
with ground tracking systems. 


- Flight data messages, cancellations, 
substitutions, candidate flights 

(a) Controller inputs from the ramp 
and FAA towers 

(b) Live terminal area data from the 
FAA’s HOST computer system 

(c) Airline schedules 

(d) Estimated pushback times. Gate 

assignments 


The reader is referred to http://www.as.nasa.gOv/aatt/research.html#72 where a number of OCDs describing the 
above mentioned and several other tools are available. 


(e) Technologies 

The information sharing and integration problem is ubiquitous and indeed the computer science and information 
management research community as well as the software industry have contributed approaches, concepts, 
techniques, tools and products towards addressing the problem or aspects of it Some of the technical areas that are 
highly relevant include information integration systems, intelligent agents, semantic web technologies such as mark- 
up languages and ontologies, information retrieval, etc. While we cannot go into detailed introductory descriptions 
of these technologies we would like to point the reader to (Knoblock et aL 2001) for an excellent introduction to 
intelligent information agents and to (Berners-Lee et al. 2001) for an introductory tutorial of the concept of the 
semantic web. We now present our evaluation of technologies in these areas for the SWIM information extraction 
and access needs. 


1 Enhanced Traffic Management System (http://hf.tc.faa.gov/proiects/etms.htm ) 
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A. Information Synthesis 

Within the rapid evolution of the information age we find the view of an intelligent agent as a reasoning, or highly 
knowledgeable specialist is giving way to a new view. This view is that agents are more of a programming 
framework that may use the aggregation, filtering, cataloging and other information transformation services 
currently emerging. A couple of examples are Google’s search services and IBM’s Webfountain project that will 
offer deeper analysis of unstructured data, such as the internet, than is possible now. [Reference: A Fountain of 
Knowledge, Stephen Cass, IEEE Spectrum, Jan 2004, pp 68 to 75.] As a consequence of our research, we divide up 
the “intelligent agent technology pie” into Model Based Synthesis and Model Based Technology Support. 

The reason we have done this is that we feel the “higher ground” to be reached in the next several decades is to have 
information systems offer up a completely new order of information processing where SWIM’s agents play a role in 
constructing models dynamically and reasoning about them in order to augment the information used in the decision 
making processes of pilots, air traffic controllers, etc. Much of this synthesized information will come from a 
number of unstructured, semi -structured data sources. 

1. Model Based Synthesis 

An important problem when dealing with information from multiple sources is that of information synthesis. For 
instance, in a certain situation, we may want reports or descriptions of previous occurrences of that situation (for 
instance say a bad weather situation which forces aircraft rerouting). We will build on our ongoing synergistic 
efforts on intelligent text search and mining in the aviation safety domain, where we developing techniques for 
intelligent synthesis of semi-structured aviation safety related information residing in multiple sources. We are 
developing techniques to enable search and synthesis in a significantly more sophisticated manner that is currently 
feasible by keyword-based search engine technology and information retrieval (IR) systems. Beyond search, we are 
also investigating techniques to identify and search for behaviour and incident patterns in textual descriptions of 
aviation related incidents or accidents. Amongst other information, document, textual and semi-structured 
information will also be important in our application domain, hence we would be addressing the information 
synthesis issues for such information in this effort. 
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We must also mention highly relevant in-house technology in this area, the NETMARK data management system 
which is a system for integrating and managing semi-structured data, that has been developed by the AEN Infolab at 
NASA Ajnes Research Center. NETMARK has already been used for several important NASA applications and is a 
promising technology for addressing some of the important TSWIM objectives as well. 

2. Model Based Technology Support 

Intelligent agent technology use has been growing significantly in industrial enterprises to the point that this 
technology is becoming main stream and in many ways the distinction between agents and portions of an operating 
system, middle layer component or application plug-in is being blurred. As a consequence, agent technology may 
be viewed as a programming framework that supports intelligent mechanisms based on algorithms and techniques 
found in artificial intelligence. With the advent of the Internet age the need to deal in semantics, i.e. ontologies, in 
order to manage die huge explosion of information confronting enterprises the role of agents is more of consumers 
of semantic information, i.e. metadata than as the main custodians of it, which was the predominate case just a few 
years ago. This is not to say that significant progress is still under way in making agent technology an order of 
magnitude more effective. There are several areas that make or break an agent framework’s suitability. 

The first one is agent habitat By habitat we mean the services and environment that the agent framework must have 
in order to operate. Closely related is the agent’s framework capable to use feature in its habitat to support its 
existence and to fulfill is role or behaviors. This boils down to the richness of services included within the 
framework to deal the kind of computing environment in which the agents are embedded or meant to service. 

The second important suitability factor for an agent framework is the richness of its internal representation of 
knowledge. Knowledge representations include specialized languages that support first and higher order logic 
constructs and may include some sort of object and its attributes and properties representation. The issue is whether 
the agent’s internal representation of knowledge is sufficient to represent the domain under consideration and how 
easy it is to use and maintain. 

Both of these suitability factors as well as others will be investigated in greater detail in phase 2. 


B. Agent and Grid Technologies Survey 

Along with the requirements analysis, we also initiated the survey and evaluation of several relevant technologies in 
the agents, grid and information mediation domains. At this stage we have identified for consideration certain 
relevant technologies, systems and frameworks; a detailed evaluation and appropriateness determination will be 
completed later. 

The technologies and systems under consideration so far include: 

1. SoFAR: An agent framework for distributed information management.(Luc Moreau 2000) 

This is a multi-agent framework designed for distributed information management tasks. Has 
research group at the University of Southampton and is available free for research use. 

2. Fred Agent Platform (http://www.netdvnamics-tech.com) 

This is a commercial technology developed by Net Dynamics Internet Technologies GmbH 
agent platform that enables automated information processing by agents, there is also strong 
web technology and ontologies in this system. 

3. Agent Framework for Grid Computing (Majid Ali Khan 2003) 

An agent based system for grid computing developed by the University of Central Florida. 

4. KAON (http://kaon.semanticweb.org) 

An open-source ontology management infrastructure targeted for business applications, developed by the University 
of Karlsruhe. 

5. KaoS (Bradshaw 1996) 

An distributed agent development and communication system developed by Boeing and the University of West 
Florida. Has been used in previous information integration efforts at NASA Ames. 


been developed by a 


and Co KG. It is an 
support for semantic 


C. Ontologies and Semantic Integration 
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In the first phase of this task we have started evaluation of ontology and semantic integration issues to be dealt with 
as part of the development of AIMS agents and to seek out tools that may aid us in the development of an AIMS 
ontology base. 

The capability of information systems to process information based on semantics, i.e. meaning is a significant 
challenge; one that is considerably more demanding than mere key, index or textual pattern comparison. The reason 
for this is that semantic meaning depends on context. To narrow the semantic interpretability space the use of 
ontologies has come into vogue as a means to more precisely limit the space of possible interpretations, i.e. context; 
the goal for any given semantic unit within a vocabulary, i.e. ontology, is to have its meaning be resolvable with no 
ambiguity. While this may be accomplished within a narrow domain with the use of formal methods there remains 
the issue of information synthesis across multiple ontologies. There is much work under way on this issue. For 
example see [Maedche] for a tool to manage distributed (distinct) ontologies collectively. This problem does not 
negate an enterprise’s need to invest in the development of its own ontologies and to invest into integration tools for 
advancing its decision support systems but to be prepared to incorporate the use of semi-structured and unstructured 
data as part of its ontology portfolio. Emerging technologies such as NASA’s NETMARK and IBM’s Webfountain, 
mentioned earlier, address the use of this kind of information. The challenge is to incorporate these as part of an 
enterprise’s collective knowledge base. 

D. Security and Access Control Issues 

A major practical problem in data sharing and integration across divisions and departments in an enterprise or across 
multiple enterprises (as will be the case with SWIM) is that of security and access control for the data being shared 
and integrated. While we can insert data integration technology to facilitate data sharing and integration we also 
need to ensure that all access control policies for the different pieces of data are indeed being respected. The 
database research has recently begun to look into this problem (Clifton et al. 2004). Also the industry appears to 
have made progress too, for instance we have had discussions with Composite Software Inc. a data integration 
company that claims to have some solutions to the security and data access issues in a data integration environment. 


(f) Particular Problems and Operating Realities in the NAS 

Meetings with senior personnel at the AATT Office at NASA Ames provided us with a valuable introductory 
insight into some of the non-technical issues and challenges peculiar to the NAS domain, as well as some of the 
realities of how the airspace is currently managed. As regards information and data, the same data will likely be used 
by different tools for different tasks and by different roles (personnel). The timescale requirements of the data will 
likely be very different for different applications. Further, the spatial scale and granularity of data required will be 
different for different applications. Also the level of detail required can be quite different. 

Uncertainty is another big issue in the NAS today. For instance decisions could be taken based on certain 
assumptions that may not be true (for instance a decision is taken with the assumption that a particular flight will 
take off at 5pm but it does not). Information may not be completely shared (for instance a pilot has certain 
information that he does not convey to potentially relevant parties). Further there could be deviations such as an 
aircraft not being on its filed flight plan (which we are told occurs quite frequently). 

The above problems and issues are more political and legacy, rather than of a technical nature. However any 
solution to information integration and management, such as the AIMS must be developed bearing such factors in 
mind. To the extent that is feasible, the AIMS team will endeavor to recommend solutions (technical or otherwise) 
to the above such problems when providing integrated access to information in the NAS domain. 

One step the AIMS project will take in the next quarter to manage these requirement issues is to develop a 
requirements trade space matrix for AIMS and use this matrix to factor out requirements and constraints in the 
development of agent based DSS tools for SWIM. The scope of this effort, which could be a project in itself, will be 
subject to limited task resource constraints at hand, but will serve as a model to base subsequent TSWIM projects 
for deploying agent based information synthesis aids in the NAS and FAA domains. 


(g) Conclusions 
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